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Abstract 

Background: A leading contributing factor to firefighter injury and death is lack of fitness. Therefore, the Fire 
Service Joint Labor Management Wellness-Fitness Initiative (WFI) was established that includes a focus on providing 
fitness assessments to all fire service personnel. The current fitness assessment includes a submaximal exercise test 
protocol and associated prediction equation to predict individual V02peak as a measure of fitness. There is limited 
information on the accuracy, precision, and sources of error of this prediction equation. This study replicated previous 
research by validating the accuracy of the WFI V02peak prediction equation for a group of firefighters and further 
examining potential sources of error for an individual firefighters' assessment. 

Methods: The sample consisted of 22 firefighters who completed a maximal exercise test protocol similar to the WFI 
submaximal protocol, but the test was terminated when firefighters reached a maximal level of exertion (i.e., measured 
V02peak)- We then calculated the predicted V02peak based on the WFI prediction equation along with individual 
firefighters' body mass index (BMI) and 85% of maximum heart rate. The data were analyzed using paired samples 
t-tests in SPSS v. 21.0. 

Results: The difference between predicted and measured V02peak was -0.77 ± 8.35 mL-kg"^-min"\ However, there was a 
weak, statistically non-significant association between measured V02peak and predicted V02peak (R^ = 0.09, F(l,21) = 2.05, 
p = 0.1 7). The intraclass correlation coefficient (ICC = 0.21 5, p > 0.05) and Pearson (r=0.31,p = 0.17) and Spearman 
(p = 0.28, p = 0.21) correlation coefficients were small. The standard error of the estimate (SEE) was 8.5 mL«kg"^«min"\ 
Further, both age and baseline fitness level were associated with increased inaccuracy of the prediction equation. 

Conclusions: We provide data on the inaccuracy and sources of error for the WFI V02peak prediction equation for 
predicting fitness level in individual firefighters, despite apparently accurate predictions for a group of firefighters. These 
results suggest that the WFI prediction equation may need to be reevaluated as a means of precisely determining 
fitness for individual firefighters, which may affect employment status, duty assignment, and overall life safety of the 
firefighter. 
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Firefighting is an occupation that requires individuals to 
work in demanding and often times physically and psy- 
chologically stressful conditions [1-6]. Successful and safe 
job performance requires firefighters to maintain, among 
other critical factors, a high level of aerobic capacity (i.e., 
fitness). One of the leading contributing factors to fire- 
fighter injuries is lack of fitness [7]. Sudden cardiac death 
also accounts for close to half of all on-duty firefighter 
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fatalities in the United States [8]. This cause of mortality 
has been linked, in part, to fitness level [9-12]. 

Accordingly, the International Association of Firefighters 
(lAFF) and the International Association of Fire Chiefs 
(lAFC) established the Fire Service Joint Labor Manage- 
ment Wellness-Fitness Initiative (WI) in 1997 [13]. The 
WFI includes a focus on fitness assessments for all fire ser- 
vice personnel - a firefighter is recommended to be at or 
above a minimal level of fitness indicative of the ability to 
successfully and safely perform firefighting duties. 

The gold standard for measurement of cardiorespiratory 
fitness is a test of peak oxygen consumption (V02peak) in 
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the laboratory through open circuit spirometry [14]. How- 
ever, this test requires expensive equipment, extensive pro- 
fessional expertise, and may require physician supervision 
[15]. An alternative method involves predicting V02peai< 
using a submaximal exercise test and validated equation. 
Both the revised 2008 edition of the WFI and 2013 
National Fire Protection Association (NFPA) 1582 standard 
medical program include a submaximal exercise test proto- 
col to predict a firefighters V02peai< [16,17]. This submaxi- 
mal exercise test is based on the Gerkin treadmill protocol 
which involves a warm-up of three minutes at 3 miles-per- 
hour (mph) followed by increases in ramp incline by 2% or 
speed by 0.5-mph every minute (i.e.. Stage 1: 4.5-mph and 
0% incline; Stage 2: 4.5-mph and 2% incline; Stage 3: 5.0- 
mph and 2% incline; Stage 4: 5.0-mph and 4% incline; Stage 
5: 5.5-mph and 4% incline; Stage 6: 5.5-mph and 6% incline; 
etc). The test is terminated when the participant reaches 
85% of estimated maximum heart rate, based on the Tanaka 
formula ((208 - (0.7 x age)) x 0.85) [18]. The predicted 
V02peai< value is then calculated from the test time (TT) re- 
quired to achieve 85% of maximum heart rate and Body 
Mass Index (BMI) of the participant. 

Previous research has assessed the accuracy of the 2008- 
revised WFI assessment for predicting V02peai< [19]. That 
study compared data from 63 male firefighters who per- 
formed both submaximal and maximal WFI exercise tests 
with expired gases analyzed by a CardioCoach CO2™ port- 
able metabolic system during the maximal test. Data ana- 
lysis (i.e., ^-test) demonstrated no statistically significant 
difference between the predicted and measured V02peak 
values, suggesting that V02peai< values from the submaxi- 
mal protocol accurately reflect directly measured V02peak- 
This result was deemed to be an improvement over the 
previous version of the WFI protocol that utilized different 
means of determining maximum heart rate (220-age) and 
the ACSM metabolic equation for running to predict 
V02peak- The previously accepted approach has consist- 
ently over predicted aerobic capacity and is no longer 
recommended in predicting V02peak in individual fire- 
fighters [20]. 

The purpose of our study was to replicate previous re- 
search by cross-validating the WFI V02peai< prediction 
equation. We further aimed to identify potential sources of 
error that may influence the accuracy of the prediction. 
We are unaware of research examining sources of error in 
estimation (i.e., participant age or fitness level) using the 
2008-revised WFI equation. Lastly, we assessed the classifi- 
cation accuracy of the V02peai< prediction equation using 
the WFI criterion of 42 mL^kg'^^min'^ as the absolute 
minimal level of fitness for duty (i.e., V02peak) recom- 
mended for all firefighters, regardless of age and sex. This 
replication and confirmation of validity and accuracy of the 
equation is important as fire departments across the nation 
use the WFI protocol to predict V02peai< and fiirther 



require a minimal level of aerobic fitness in all firefighters 
as a requirement for employment or return to duty assign- 
ment. A lack of precision in the V02peai< prediction equa- 
tion could erroneously deny an individual firefighter from 
duty or place a firefighter on duty whose limited aerobic 
capacity may prevent them from appropriately carrying out 
demanding occupational duties and even present a risk for 
on-duty injury or cardiac death. 

Materials and methods 

Participants 

Participants were limited to currently employed and ac- 
tive line firefighters and Illinois Fire Service Institute 
(IFSI) field staff who were a) between the ages of 18 and 
60 years, b) cleared by their home department to partici- 
pate in live-fire activities, c) free from known cardiovas- 
cular disease (as determined by the Participant Activity 
Readiness Questionnaire (PAR-Q [21], d) with no history 
of neurological, gait or postural disorder, and e) not re- 
cently suffering an injury or surgery that results in gait 
or postural disruption. All firefighters provided informed 
consent and associated procedures were approved by the 
University of Illinois institutional review board. 

Equipment 
COSMED K4b2 

The COSMED K4b2 is a commercially available portable 
metabolic unit that measures oxygen consumption (VO2) 
and carbon dioxide production (VCO2) on a breath-by- 
breath basis (K4b2 Cosmed, Italy). The K4b2 portable unit 
and battery weigh about 1100 grams (-2.4 pounds) and is 
specifically designed to be worn by the subject during activ- 
ity [22]. The K4b2 uses an O2 and CO2 analyzer connected 
to a flowmeter with a bidirectional digital turbine. The 
flowmeter is attached to a rubber facemask (Hans-Rudolph, 
Kansas City, MO) that is placed to tightly cover the partici- 
pants mouth and nose. Although the K4b2 system is vali- 
dated for VO2 measurements over a wide range of 
exercise intensities [22], previous studies have demon- 
strated a repeatable pattern of overestimation [23,24] 
that can be corrected by using a validated regression 
equation [24]. Therefore, we applied the equation pro- 
posed by Duffield, et al. to the V02peak values measured by 
the K4b2 [24]. After a 30-minute warm-up, the O2 and 
CO2 analyzers of the K4b2 were calibrated using previ- 
ously verified concentrations of gases, and the flow meter 
was calibrated using a 3 L syringe (Hans Rudolph, Kansas 
City, MO). The K4b2 and battery were both placed in the 
standard shoulder harness that was secured with the K4b2 
resting on the chest and the battery on the upper back. 
This standard harness allows for minimal interference 
during ambulation on the treadmill. 
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Maximal exercise test procedure 

All maximal exercise tests were performed at the Illinois 
Fire Service Institute (IFSI) in Urbana-Champaign, IL. A 
research member initially measured the firefighter s height 
and weight using a standard weight scale and height rod. 
Each test began with 5 -minute period of data collection in 
the sitting position to allow for the collection of resting 
heart rate and oxygen consumption data. Participants then 
began walking on the treadmill for a 3-minute warm-up 
period. After the warm-up, firefighters completed the 
same Gerkin treadmill protocol used in the WFI sub- 
maximal assessment. However, the test was not terminated 
when firefighters reached 85% of estimated maximum heart 
rate, but rather was terminated when firefighters reached a 
maximal level of exertion. Verbal encouragement was pro- 
vided throughout the testing session by research staff to en- 
sure maximal effort. At each minute, heart rate and rating 
of perceived exertion (RPE) [25] were recorded. The Borg 
RPE scale was described to each participant prior to testing 
to allow for complete understanding and familiarization. 
The test was considered finished when the participant indi- 
cated volitional fatigue, and this coincided with a reported 
RPE >17. There were no other criteria for completion such 
as plateau of VO2. A cool down period then followed, con- 
sisting of walking at a comfortable speed and 0% grade. 
The highest 15-second average recording of VO2 by the 
COSMED K4b2 was considered V02peai<. 

Data analysis 

We initially calculated the predicted V02peak for each 
firefighter based on the WFI estimation equation [16]: 

Predicted V02peak {mL*kg~^*mm~^) 

= 56.981 + (1.242 x TT)-{0.805 x BMI). 

The test time (TT) wherein a firefighter reached 85% of 
estimated maximum heart rate (i.e., (208 - (0.7 x age)) x 
0.85) was based on the K4b2 15-second averaging data as 
the time when the participant reached the intended heart 
rate value for 15 seconds and did not further decrease dur- 
ing the remainder of the test. This test time was then 
inserted into the WFI equation, along with BMI, to calcu- 
late predicted V02peai<- We then calculated the corrected 
measured V02peak by applying Duffield et al. s regression 
equation to the recorded maximalV02peai< from the K4b2. 
This equation is as follows [23] : 

y02peak,Measured=^-^^^ {yO2peak,K4^b2-0. 227^ . 

All analyses were conducted using SPSS v. 21.0. De- 
scriptive statistics are presented as mean ± SD. Paired 
samples ^-tests with 2-tailed a of .05 were conducted for 
examining absolute mean differences in predicted vs. 
measured V02peak- We estimated the association be- 
tween the predicted and measured V02peak by using the 



Pearson product-moment correlation coefficient (r) and 
Spearman's p. The scatterplot along with line of best fit 
and 95% confidence intervals is provided in a figure to 
visually demonstrate the association between predicted 
and measured V02peai<- We estimated the intraclass cor- 
relation coefficient (ICC) between predicted and measured 
V02peai<- Linear regression analysis was conducted by 
regressing predicted V02peai< on measured V02peak in the 
entire sample to provide the R^ value for strength of asso- 
ciation and standard error of the estimate (SEE) as an indi- 
cation of precision. We produced a Bland- Altman plot of 
the difference between measured and predicted V02peak 
and mean of measured and predicted V02peak- We exam- 
ined the correlation of participant characteristics (for ex- 
ample, age, BMI, and measured fitness level) with the 
difference between predicted and measured V02peak- The 
classification accuracy of the V02peai< prediction equation 
was determined by identif)^ing if the predicted V02peak 
value demonstrated an underestimation, overestimation, 
correct pass, or correct fail when compared to the mea- 
sured V02peai<- Firefighters were classified according to the 
current aerobic fitness classification criterion used by the 
WFI and NFPA 1582 (42 mL4<g '^min ^). 

Results 

Sample characteristics 

The participants (n = 22) had an age range between 19 - 
43 years with a mean of 27.5 years ± 7.1. The mean ± SD 
height, weight, and BMI of the participants were 1.82 ± 
0.07 m, 89.6 ± 13.8 kg, and 27.1 ± 3.6 kg^m^'\ respectively. 

Descriptive statistics for measured and predicted V02peak 

The predicted V02peai< (43.72 ±3.60 mL4<g'^min'^) was 
not significantly {t = -.430, p = 0.672) different than the 
measured V02peai< (44.49 ± 8.72 mL4<g '^min'^). The differ- 
ence between predicted and measured V02peai< was -0.77 ± 
8.35 mL»kg'^»min'^. The intraclass correlation coefficient 
(ICC) between measured and predicted V02peai< was weak 
(ICC = 0.215, j9>0.05). Pearson (r = 0.31, p = 017) and 
Spearman {p = 0.28, p = 0.21) correlation coefficients dem- 
onstrated weak associations between predicted and mea- 
sured V02peal<. 

Regression analysis 

The scatterplot along with the 95% confidence limits of 
the association between measured V02peai< (independent 
variable) and predicted V02peai< (dependent variable) for 
the overall sample is provided in Figure 1. There was a 
weak, statistically non-significant association between 
measured V02peak ^nd predicted V02peai< (R^ = 0.09, F 
(1,21) = 2.05, p = 017). The lack of precision for predicted 
vs. measured V02peak is demonstrated in the standard 
error of the estimate (SEE = 8.5 mL^kg'^^min'^). 
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Figure 1 Scatterplot along with line of best fit and 95% confidence limits for the association between measured and 
predicted VOapeak- 



Bland-Altman plot 

The Bland-Altman plot in Figure 2 demonstrated variability 
in the prediction of V02peai< in the overall sample, but this 
variability was within ±2SDs of the mean value. However, 
the difference between predicted and measured V02peai< ap- 
peared to be directly related with the average V02peai< value. 



Correlates of inaccuracy 

There was a weak association between age and difference 
of predicted and measured V02peai< values (r = -.36, p = 
0.10; p = -.46, p < 0.05). There was a strong and significant 
association between fitness level (i.e., measured V02peak) 
and difference between predicted and measured V02peak 
values (r = 0.91, p < 0.05; p = 0.94, p < 0.05). 
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Figure 2 Bland-Altman plot of the difference between 
measured and predicted V02peak as a function of the mean of 
measured and predicted VOapeak the entire sample. The lines 
represent average +2SDS. 



Classification accuracy 

The V02peak prediction equation misclassified eight of 
the 22 firefighters (i.e., 36% of the sample) in comparison 
to measured V02peak values. Four participants' V02peaks 
were underestimated (-6.07 ± 0.95 mL^kg'^^min'^), and 
four participants' V02peai<s were overestimated (8.58 ± 
2.54 mL^kg'^^min"^), when using values calculated by 
the prediction equation. This is demonstrated below in 
Table 1. 

Discussion 

We found no mean difference between predicted and 
measured V02peai< at the overall group level, consistent 
with previous research [19]. We demonstrate a mean 
V02peai< difference of -0.77 mL^kg'^^min'^ and previous 
research reported a mean difference of 0.25 mL^kg'^^min'^ 
[19]. This might suggest that the 2008-revised WFI pre- 
diction equation is accurate. However, there was large 
error and disagreement in prediction at the individual 
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Table 1 Classification accuracy of the V02peak prediction equation 






Measured VOor,^,^ 




'Not Fit' 


'Fit for Duty' 




< 42 ml.kg \min'^ 


>42 ml.kg \min'^ 


'Not Fif 


N = 4 


N=4 


<42 ml.kg \min'^ 


Measured: 33.46 ±3.79 


Measured: 46.64 ±2.1 7 


Predicted VOapeak 

Tit for Duty' 


Predicted: 39.20 ±3.09 
N = 4 


Predicted: 40.57 ±1.51 
N = 10 


>42 ml.l<g \min'^ 


Measured: 37.39 ± 1.69 


Measured: 50.87 ±6.81 




Predicted: 45.97 ±1.79 


Predicted: 45.89 ±2.07 



level based on the SD of the mean difference between 
predicted and measured V02peak> standard error of the 
mean (SEM), and the Bland- Altman plot. Further, there 
was a weak association between predicted and measured 
V02peai< values based on correlation, ICC, and regression. 

Although the difference between predicted and mea- 
sured V02peai< was Small, the associated SD was ~ ± 
9 mL^kg'^^min'^. This SD demonstrates high variability in 
accuracy when using the V02peak prediction equation. The 
Bland-Altman plot demonstrates evidence of systematic 
error between measured and predicted V02peak as a func- 
tion of the mean of measured and predicted V02peak. 
Some of the data points approach ~ 2 SDs difference, re- 
vealing high variability in the accuracy of the prediction 
equation based on firefighters baseline fitness level. Fur- 
ther, both Pearson and Spearman correlation coefficients 
demonstrated weak associations between predicted and 
measured V02peak- The small ICC between measured and 
predicted V02peak (ICC = 0.215) highlights disagreement 
between measured and predicted values in such that par- 
ticipants' rank of V02peak greatly differed depending on 
the measured or predicted value. Lastly, the statistically 
non-significant association between measured V02peai< 
and predicted V02peak is demonstrated in the linear re- 
gression, and the lack of precision for predicted vs. mea- 
sured V02peak Is Verified in the large standard error of the 
estimate (SEE = 8.5 mL^kg'^^min'^). This lack of relation- 
ship does not appear to be related to a truncated range of 
V02peai< values, as our data indicate a measured V02peak 
range of 34 mL»kg'^»min'^. 

The data analysis identified specific correlates of inaccur- 
acy for the difference between predicted and measured 
V02peai< values. We demonstrated age of the firefighters to 
be related to inaccuracy of the prediction such that the 
V02peai< of older firefighters was recurrently overestimated 
and the V02peai< of younger firefighters was underestimated 
when compared to measured V02peak values. This high- 
lights a problem with the prediction equation for V02peai< 
as fire departments in the United States employ men and 
women of wide age range, with the majority being younger 
than 50 years of age [26]. The prediction equation also 



consistently overestimated fitness in firefighters with a 
lower baseline fitness level (i.e., V02peai<) and underesti- 
mated firefighters' fitness in those with a higher baseline 
fitness level. This association is further demonstrated in 
the Bland-Altman plot. Therefore, these inaccuracies may 
restrict younger firefighters with sufficient V02peak from 
being placed on duty. Importantly, together these corre- 
lates suggest the highest risk for overestimating fitness lies 
in older, less fit firefighters; the group that is at the highest 
risk for sudden cardiac events. Further, when classifying 
firefighters as fit for duty according to the WFI criterion 
(V02peak ^ 42 mL»kg'^»min'^), the predicted V02peai< values 
calculated from the estimation equation would misclassify 
eight firefighters, overestimating four and underestimating 
four firefighters actual V02peak (i-e., 36% of our sample 
would be misclassified). Therefore, four firefighters would 
be placed on duty with limited aerobic capacity, potentially 
increasing risk for inability to complete duty assignment or 
more importantly, on-duty injury or death. On the con- 
trary, four firefighters with a suitable V02peai< (i-^-, 
V02peak ^ 42 mL^kg'^^miu'^) may be restricted from duty 
due to inaccurate V02peak predictions. 

We did not have large enough sample for generating a 
new estimation equation, and this could be the focus of fu- 
ture research. One such revision might be to utilize a 
measure other than BMI in the estimation equation, since 
younger, resistance-trained participants with more lean 
muscle mass may have a higher BMI value, although the 
individual firefighter is not obese. Therefore, the use of 
body composition or girth might provide a more accurate 
estimation. 

Strengths and limitations 

The strengths of our study include the wide age range of 
participants (range = 19-43 years) and the same research 
team and equipment conducted all tests for increased test 
consistency and inter-rater reliability. Further, each max- 
imal exercise test was conducted by trained personnel 
with years of experience conducting maximal exercise 
tests with the COSMED K4b2 portable metabolic unit. Al- 
though this study has many strengths, it is not without 
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limitations. A study limitation includes the small, predom- 
inantly male (n = 21) and Caucasian (n = 22) sample size. 
However, the US Department of Labor reported in 2002 
that female firefighters accounted for 3.7% of all persons 
in the occupation [27], which provides rationale for the 
mostly male sample. Nevertheless, the findings of this 
study should be replicated in a different, larger sample of 
firefighters. 

Conclusion 

This study demonstrates no overall or mean level inaccur- 
acy for the 2008-revised WFI V02peai< prediction equation 
compared with measured V02peak for the entire sample. 
However, we did demonstrate inaccuracy and variability in 
the estimation equation as a function of individual charac- 
teristics, particularly baseline fitness level and age of the 
firefighters. We further indicate that based on the WFI cri- 
terion minimum V02peak of 42 mL»kg'^»min'^, 36% of our 
sample of firefighters would be misclassified in terms of 
"fitness for duty". These results suggest that the currently 
utilized prediction equation may need to be reevaluated as 
a means of precisely determining fitness for individual 
firefighters, which may affect employment status, duty as- 
signment, and overall life safety of the firefighter. The 
need to accurately assess fitness for duty in the Fire Ser- 
vice is well documented and well founded, so continued 
development of a validated, accurate and precise fitness 
test is strongly encouraged. 
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